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1. Introduction 



CIPRIAN DEMETER, MICHAEL LACEY, TERENCE TAO, AND CHRISTOPH THIELE 
Abstract. We study the Walsh model of a certain maximal truncation of Carleson's 

o ■ 
o 

(N 
O 

Q 

Let T> denote the collection of all the dyadic intervals of the form [2 t m,2 l (m + 1)), 
i, m G Z and let C :D (R + ) be the set of all the functions / : R + — > R that are finite linear 
^ . combinations of characteristic functions of dyadic intervals. 

For I > we recall that the I— th Walsh function W\ is defined recursively by the 
^ formula 

W = l [0 ,i) 
W 2l = W l (2x) + W l (2x-l) 
W 2l+1 = Wi(2x)-W l (2x-l). 
We recognize that W\ is the Haar function also denoted by h. 

Definition 1.1. A tile P is a rectangle Ip x ujp of area one, such that Ip and up are 
dyadic intervals. If P = [2 l n,2 l (n + 1)) x [2~ l l,2~ l (l + 1)) is such a tile, we define the 
corresponding Walsh wave packet wp by 

w P (x) = 2' l/2 W l (2~ i x -n). 

The intervals Ip and wp will be referred to as the time and frequency intervals of the 



tile P. 



Definition 1.2. A bitile P is a rectangle Ip x up of area two, such that Ip and up are 
\ dyadic intervals. For any bitile 

P = [2% 2\n + 1)) x [2-' l+1 l, 2- l+1 {l + 1)) 

we define the lower tile 

Pi = [2V, 2 l (n + 1)) x [2l2-\ (21 + 1)2"*) 

and the upper tile 

P 2 = [Tn, 2\n + 1)) x [(2/ + 1)2"*, {21 + 2)2"*). 

If lop is the frequency interval of the bitile P then we will use the notations Up } \ and ujp t2 
for the the frequency intervals of the sub-tiles P\ and P 2 ■ 
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We next recall the definition of the Walsh- Fourier transform. Except on a set of measure 
(which we shall always ignore), every x G R + can be identified with a doubly-infinite 
set of binary digits 

x = .. .020100. a_ia_2-.. 

where a n G Z 2 and a n is eventually zero as n — > oo. We define two operations on R + by 

a n (x®y) := a n (x) + a n (y) 

a n (x®y) := ^ a m (x)a n „ m (y), 

meZ 

where the addition and multiplication in the right hand terms are considered modulo 2. 
We next define the function e : R + — > { — 1, 1} to be 1 when a_i = and —1 when a_i = 1. 
Using this we can introduce the Walsh- Fourier transform of a function / G C V (R + ) to be 

/(£):= J e(x®£)f(x)dx. 

We also note that the inverse Walsh- Fourier transform /~ and the Walsh- Fourier transform 
coincide in this context. 

In the following we will denote with S umv the collection of all the bitiles. It is known, 
see [6], that the almost everywhere convergence of the Walsh series for / G LP 

Y^if^W^x) 

l>0 

is a consequence of the estimate 

l|w/|| p < ii/ii,, 

where 

w/(x) = || (f,w Pl )w Pl (x)^Mhr- 

pgguniv 

Define the M| norm of a family of Walsh multipliers m k as 

\\(m k (e)) keZ \\ M *(e) = sup ||sup|(#m fc )~(V)||| L 2. 

||ff||2=l k 

In this paper we will be concerned with getting estimates for the operator 
W max /(*) = IK E (f,w Pl )w Pl (x)l^ 2 (6))kez\\M m . 

PeS univ :|7p|<2 fc 

Theorem 1.3. For each 1 < p < oo we have 

l|W m -/|| p < p \\f\\ p . (1) 

It has been acknowledged, see for example [4], [7], that the Walsh models provide a lot 
of the intuition that lies behind their Fourier analog. In our case, the interest in proving 
Theorem 1.3 is motivated by its connections with the following Return Times Theorem 
due to Bourgain [2]. 
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Theorem 1.4. Let X = (X, r) be a dynamical system and let 1 < p, q < oo satisfy 
i + i < 1. For each function f G L P (X) i/iere zs a universal set X C X /i(X ) = 1, 
sitc/i t/iat /or eac/i second dynamical system Y = (Y,? ,v,cr) , each g G L q (Y) and each 
x G Xq, the averages 

- f(r n x)g(a"y) 

n=0 

converge v- almost everywhere. 

In [3] we extend Bourgain's theorem to a larger range of p and q. Our argument there 
relies on estimates like the one in Theorem 1.4 for a model operator which is the Fourier 
counterpart of W max . We hope that our presentation here for the simpler Walsh model 
will ease the understanding of the the proof in [3]. 

We note that in order to prove Theorem 1.3 it suffices to assume that the summation 
in the definition of the operator W max runs over a finite collection S C S umv of bitiles, 
and to prove inequality (1) with bounds independent on S. We fix the collection S for 
the remaining part of the paper. 

The argument relies on first splitting the collection of bitiles into structured collections 
called trees. The bitiles in each tree give rise to a modulated Littlewood-Paley decompo- 
sition. The model operator W max restricted to each such a tree is estimated in Section 3, 
by using the Calderon-Zygmund-type estimates from Section 2. 

In Section 5 the operator W max /(x) is estimated pointwise, and it is shown that for 
each x the contribution to W max /(:r) comes from one stack of trees. Crucial to estimating 
this contribution is a weighted version of a maximal multiplier result due to Bourgain. 
This is proved in Section 4. The different pieces of the proof are put together in the last 
section of this paper. 



2. Variational norm estimates for averages 

Let if be a separable Hilbert space equipped with a norm | • \h and denote by £ 9 (R, H) 
the measurable functions on R with values in H whose q-th power are integrable. Let 
E(/|D fc ) denote the conditional expectation with respect to the a-algebra on R generated 
by the dyadic intervals of length 2 k . We include the case k = oo by setting E(/|P 00 ) = 0. 
From now on we will use the notation 

1 x-l(I) 

9i( x ) = jjy^ai |7j ) 

for each dyadic interval / = [1(1), r(I)). 

Lemma 2.1 (Jump inequality). Consider 1 < q < oo and f G -^(R, H). For each x and 
A > define the entropy number M\(x) be the maximal length of a chain oo = k$ > k\ > 
k 2 > ■ ■ ■ > kM x (x) such that for each 1 < m < M\(x) 

|E(/|2> Jkm )(^)-E(/|2> fcm _ 1 )(x)|H>A. 

Then 

\\^Ml /2 (x)\\ Ll ^ H) <C q \\f\\ Li(R,H) 
where the constant C q remains bounded for q in any compact subinterval of (1, oo). 
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Proof This result is well known, we briefly sketch the proof for completeness. First we 
establish that the number M\(x) of A-jumps can be estimated by counting the A/2-jumps 
in a greedy algorithmic way. Let ko(x) = oo and for m > 1 let k m (x) be the minimal 
number, if it exists, such that \E(f\V km{x) )(x) - E(f\V km _ l{x) )(x)\ H > A/2. Let M\(x) be 
the maximal index for which &m a 0) exists- Define 

A x = {koix),^),... ik^ {x) } 

X x = {J e V : x G J, | J\ = 2 fc for some A; G A x }. 

Then one easily checks that M\(x) < M\(x). The crucial additional property of this 
greedy selection is that the initial parts of the sequence k m coincide for two nearby values 
of x until the value of 2 km gets smaller than the length of the smallest dyadic interval 
containing both values. 

For each x and each selected interval J Git, let 1j be the collection of dyadic intervals 
contained in J but not contained in any interval from X x of length smaller than \ J\. By 
vector valued Calderon-Zygmund theory we have 

Li(R,H) (2) 

JeZ x ieij 

uniformly in all choices of signs ej G { — 1,1}. For q = 2 this is an easy Hilbert space 
argument using orthogonality of the functions hj. For q < 2 we use a Calderon-Zygmund 
decomposition of |/| to obtain a weak endpoint at q — 1 and then interpolate. For q > 2 
we use BMO techniques, i.e., we estimate the sharp maximal function 



9*(x) = sup(^ f{g- gi ? ) 1/2 = sup(^ fg 2 - g\ ) 
xei I 1 1 J i x€i I 1 1 J i 



1/2 



of the function g on the left hand side of (2) by the maximal function of |/|, and then use 
standard L q bounds for the sharp function and the maximal function. 
Inequality (2) implies 

ii( E mf\v km{x) )(x) - mv kmMx) ){x)\ 2 H y/x < c q \\f\\ Lq{R , H) 

l<m<M x (x) 

and using that all jumps are at least A/2 proves the lemma. □ 

Define the r- variational norm of a sequence g k of elements in H to be 

M 

ll^fcllv(fc) : = sup \g k \ H + sup (E \9k m ~ ^ m -!l^) 1/r 

k M,ko,ki,---,kM m=1 

One may also define some "weak" variational norm 

||0fc||v.~(k) : = sup \g k \ H + supAM| /r . 

k A>0 

where M x is the maximal number of indices k , k±, . . . , ku such that \g km — gk m -An — ^ 
for all 1 < m < M. We have the usual estimate for the V r (k) norm in terms of M\ 



/ II „ r, r u» dX 

\9k\\v(k) < \\9k\\oo + L J A M\ -y 



The jump inequality in Lemma 2.1 is almost a V 2 '°° inequality, with the difference 
that in that inequality A is independent of x, while in an honest V 2 '°° inequality the 
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parameter A may be maximized at every x individually. Hence the jump inequality is 
somewhat weaker than a V 2,OD inequality. By integrating over all A and using Fubini one 
can abandon this disadvantage of A being constant in x and prove honest V r (k) norm 
estimates with r > 2. 

Lemma 2.2 (Variational estimate). Let 1 < q < oo and f e L q (H,H). Then for 
2 < r < oo we have 

\\\\E(f\V k )(x)\\ V r (k) \\ Ll < C q (l + (r - 2)- 1 )||/|| L9(Ri „ ) 

where C q remains bounded on any compact interval of (1, oo). 

Proof For each x and A > we denote by M x (x) the entropy number of the collection 
{E(f\V k )(x) : k G Z}. We first consider this inequality for |/| being the characteristic 
function of a set A. Then M x = for A > 1. Hence we can write for 2 < r < oo 

/ r 1 d\\ 1/r 

\\E(f\V k )(x)\\ Vr{k) < C (J X 2 M x (x) A^ 2 yJ 

The right hand term is an L r x (dfi) norm of (\ 2 Mx) 1 ^ r (x) with respect to an appropriate 
measure space of total mass \\/jl\\ = \ r ~ 3 d\ = (r — 2) _1 . 
In the case q = r we get 

||||E(/|P fe )(x)|| y . (fe) || L . < C||||(A 2 M a (x)) 1 /H|^ w ||ls 

= C||||(A 2 M A (x)) 1 /'-|| LE || L3;( ^ ) 

<C\\\A\^\\ Ll(dfl) 

< C{r -2)- 1/r \A\ 1/r . 

Here we have used that 

J \ 2 M x (x) dx < C\A\ 

from the jump inequality in Lemma 2.1 applied with q = 2. We remark that (r — 2)~ 1 l r 
is bounded by 1 + (r — 2) -1 . 

If q > r, then we invoke Holder's inequality 

\\(\ 2 M x (x))^ h r xW < (r- 2)^11 { \* Mx {x)y/r\\ Lqm 

and 

J(X 2 M x (x)y/ r dx<C 2q/r \A\ 

and then proceed as above to obtain 

\\\\E(f\V k )(x)\\ V r {k) \\ Ll <C 2q/r (r-2)- 1 / r \A\V« 

Observe that 2 < 2q/r < q, so we can write C q instead of C^/r- 
If q < r, we will prove a weak type inequality 

m{x : ||E(/|P fc )(aOllv-(fc) > v} < C q (l + (r - 2)- l )y- 

Define 

E = {x: sup — j \f\(y)dy>is}. 
xeiev J | ,// 
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Outside E, we may replace / by the good part g of the Calderon-Zygmund decompo- 
sition of / in order to calculate the value of E(f\V k ). As usual we have 

IMIl-cr,*) < c^\\f\\^ H) . 

Hence we have 

m{x : ||E(/|P fc )(a;)||vr( fc ) > u} < \E\ + m{x G E c : \\E(f\V k )(x)\\ V r {k) > v} 

< C' ff i/- 9 ||/||i, fRifl) + C7i/-'-||||E(/|2> fc )(a:)||^( fc )||ir (£ c ) 

<C g (l + (r-2)- 1 )^||/f L9(Rif/) 

The Lemma now follows by Marcinkiewicz interpolation, passing from restricted weak 
type to strong type inequalities. 
□ 

3. General facts about Walsh time-frequency analysis 

The endpoints of the dyadic intervals will be called dyadic points. For each dyadic 
interval oo = [a,b], the subintervals u 1 := [a, and u 2 :— [^,&] will be referred to as 
the left and right children of u>, respectively. 

Definition 3.1. For two tiles (or bitiles) P and P' we write P < P' if Ip C I P , and 

ujpi C up. 

Definition 3.2. A tree with top (It, £t) is a collection of bitiles T C S such that I P C I T 
and ^ up for each P e T. An i-tree is a tree T such that ^ T G uj p ^ for each P G T. 

Definition 3.3. Fix some f : R + — > R. For a finite subset of bitiles S' C S define its 
size relative to f as 



t \ Ut r~L 



size( 

T 1 \ I / T> I 

PeT 

where the supremum is taken over all the 2-trees T C S'. 

We recall a few important results regarding the size. 
Proposition 3.4. For each 1 < s < oo, each 2-tree T and each f G L S (R + ) we have 

1/2 



< inf MJ(x). 
xeiT 



Proof See for example Lemma 1.8.1 in [5]. □ 

The following Bessel type inequality, see for example [4], will be used to organize col- 
lections of bitiles into trees. 

Proposition 3.5. Let S' C S be a collection of tiles and define A := [— log 2 (size(S'))] ; 
where the size is understood with respect to some function f G L 2 (R + ). Then S' can be 
written as a disjoint union S' = IJ n >A w here size(P n ) < 2~ n and each P n consists of 
a family T-p n of pairwise disjoint trees satisfying 

E I / t|<2 2 "||/||L (3) 
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with bounds independent of S' , n and f . 

Elementary computations show that for each tile P = [2 l n, 2 l {n + 1)) x [2~H, 2~ l (l + 1)), 
each I' > and each f G [2 _i Z', 2 _i (Z' + 1)) we have 

wp(x) = l /p (a;)e(2~ i / <g> x) 

wp(i)e(^x) = e(P,0l/ P (a:)e(2- i |Z'-Z| <g> a;) 

where e(P, £) G { — 1, 1} depends on P and £ but not on x. In particular, if T is a 2-tree 
and P G T then 

w Pl (x)e(£ T <g> x) = e(P,£ T )wp'(^) 

where P' = [2 l n,2 l (n + 1)) x |2~\2~* +1 ), and thus wp 1 (x)e(£ T <8> is constant on both 
the left half and the right half of Ip. An immediate consequence is that for each k G Z 
and each ap G R 

e(£ T <S> ^ a P w Pl (x) = E(e(£ T •) ^ apiupjPfc-iXx). 

PeT:|7p|>2 fe PeT 

Since 

w Pl (x)e(£ T <8> x) = e(P,£ T )2- i/2 W^ 1 (2- i a; - n) 
= e(P,£ T )/i Jp (x) 

where h is the Haar function, the classical theory of wavelets and John-Nirenberg's in- 
equality imply the following. 

Theorem 3.6. Let T be a 2-tree and assume (ap)p eT G R satisfy 

(i, £ MY/* < p, 

I I p 6T 

JpCJ 

for each dyadic interval I. Then for each 1 < s < oo 

||e(£r ® •) XI a P w Pl \\bmo ^ P 

and 

||e(£r ® •) X a ^il^ ~* 5 I j t| 1/s - 
PeT 

As an immediate consequence of Theorem 3.6 and of Lemma 2.2 we obtain the following. 

Theorem 3.7. Let T be a 2-tree, f : R + — > R and let size(T) denote the size ofT with 
respect to the function f . Then for each 1 < s < oo 

UK X «pWPi (aOHv(fc)||i£ S size(T)|/ T | 1/s - 
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4. A GENERALIZATION OF A LEMMA OF BOURGAIN 

In this section we generalize a maximal multiplier result due to Bourgain [1] We begin 
with the following easy consequence of Minkowski's inequality. 

Lemma 4.1. Let E be a finite set. Consider also two sequences a k and b k in the Hilbert 
space / 2 (S) and define a k *b k G Z 2 (2) by (a k *b k )^ = (a,k)i(bk)^- Then 

IK*&fc||yr (fe ) < II ( Q fc)g \\yr(k) II (h)t\\yr(k)) 1/2 i 

Proposition 4.2. Let H be a Hilbert space. Assume we are given a set A of linear 
Junctionals f — * = (/, e^) 7 G H, of norm less than e such that 

Ei/ (q) i 2 ^i/i 2 

for each f G H. Set N = e 2 \A\. Let f k be a sequence of H -valued functions on R such 
that we have the variational inequality 



fk(x)\\ V r {k) \\ L 2 < F. 
Then we have 

\\\\sup\t\x)\\\ lHA) \\ Li KCN^-^F. 

k 

A special example of a collection of linear functionals as in the Lemma can be obtained 
by choosing the to be an orthonormal family of vectors and e = 1. Our main appli- 
cation will involve a more general set of linear functionals. We remark that the difficulty 
in this proposition comes from the fact that we take the supremum in k before we take 
the square sum of the components. 

Proof Fix x and define C x = {fk{x)} and d(x) = diam(C a; ). It suffices to prove the 
Proposition in the case C x is finite and then to invoke the Monotone Convergence The- 
orem. Also, we can assume with no loss of generality that C x contains the origin 0. For 
each A > denote by N\(x) the minimum number of balls with radius A and centered at 
elements of C x , whose union covers C x . It is an easy exercise to prove that 

supA< r (x)< r \\fk{x)\\yr {k) , (4) 
A>0 

with the implicit constant depending only on r. For each n > — \og 2 (d(x)), let C n , x be a 
collection of elements of (C x — C x ) such that 

\c\h < 2~ n+2 for each c G C njX , 

%C n < N 2 - n (x) + l 

and each c G C n>x can be written as 

c = ^2 ° n with ° n e Cn > x ' ( 5 ) 

n>- \og 2 (d(x)) 

Here is how C n , x is constructed. For each n > — \og 2 (d(x)) define B n)X to be a collection 
of N 2 -n(x) elements of C x such that the balls with centers in B HiX and radius 2~ n cover 
C x . If n = [— \og 2 (d(x))] — 1 define B n:X = {0}. For each n > — log 2 (d(x)) and each 
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c G B nx , choose an element d G B n _\ x such that the ball centered at c and with radius 
2~ n intersects the ball centered at d and with radius 2 _n+1 . Define 

C n , x := {c-d :c6 5 v }U{0}. 

Since C x is finite, for each c G C x there is n such that c G -B n:£ .. To verify the representa- 
tion (5) for an arbitrary c G C x , denote as above by d the element from B n _\ x associated 
with c, by c" the element from B n _ 2x associated with d and so on, and note that this 
sequence will eventually terminate with 0. Hence we can write 

c = ( c -d) + (c'-d') + ... . 

Note also that by construction, each element of C ntX has norm at most 2~" +2 . 
This together with inequality (4) further allows us to write for each x and a 

™I>\ft\x)\ 

k 

< c S e U P 
n>-log 2 (d(x)) ° n 

< E min(2-"e,( E l^l 2 ) 172 

n>— log 2 (d(x)) \ C n £Cn,x 

Summing over a we get 

E^pi/^^i) 2 

a 

< E min(2- 2 "iV, E \cn\ 2 H ) 

n>— log 2 (d(x)) c n €C n ,x 

<2~ 2n E mm(N,N 2 -n(x)). 

n>-log 2 (d(x)) 

Taking finally the L 2 norm in x gives 

llllsupl/^^HI^IIi, 

k 



< I E 2~ 2n Ndx+ ( E 2- 2n N 2 - n {x)dx 

2-"<d(x)/N 1 / 2 d(x)/N 1 / 2 <2-"<d(x) 

< I ' d 2 {x)dx + I E 2- {2 - r)n 2- rn N 2 - n (x)dx 

d{x)/N 1 / 2 <2~ n 

J d 2 {x)dx + N r/2 - 1 J d{x) 2 - rS ^2~ rn N 2 - n {x) dx 



< 



<^ 12 - 1 j \\fk{x)\\ 2 V r {k) dX 

< N^^F 2 . 



This finishes the proof. □ 
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Corollary 4.3. Let 2 < r < oo. Assume we are given a set E C R + of cardinality N > 1 
and assume that there is no dyadic interval of length 1 which contains more than one 
point in S. For every k > define Qk to be the union of all dyadic intervals of length 2~ k 
which have nonempty intersection with S. For each u <E Qk let e w be a number so that for 
every nested sequence of intervals Uk G flk we have 

||e«Jv(k) < a. (6) 

Define 

A k f(x) = (j2^ i Jy( x )- 

Then 

||sup|A fe /||| 2 < r ^ r/4 - 1/2 ||/|| 2 . 

k>0 

Proof Fix / G L 2 (R + ). For each k > we will denote by ui^k the unique dyadic interval 
in fl k such that £ G cj^, and by u^(:r) = e(x <S> £)■ Let if be the iV dimensional Hilbert 
space / 2 (S). Define the sequence of functions fk : R — * if, A; > 0, by 

(/fc(z))* = ^(/WHa;) 

and note that 

(fk(x))i = wz(y)(fk(x®y))t (7) 

for all y G [0,1). 

To construct the vectors e^ a \ choose some small negative integer m so that all are 
constant on dyadic subintervals of [0, 1) of length 2 m . We write w^(J) for this constant 
value on such an interval J. For each such interval J a , a G A := {1, 2, . . . , 2~ m }, define 

e («) = (2™/ 2 ^(J Q ))^ s . 
The corresponding linear functionals are of norm e = 2 m / 2 |H| 1 / 2 . We also have 

5> (a) l 2 < A£^(*)| 2 ^<J>| 2 , 

for each g & H. In the last inequality we have used that the functions are orthogonal on 
[0, 1). Hence the functionals satisfy the assumption of Proposition 4.2 with iV = e 2 |^4| = 
|S|. We observe the following 

(HI sup \fi a \x)\\\i2 {A) \\ L 2 = f f sup \J2 e ^k w dy)(fW k Y(x)\ 2 dydx 
k Jn + Jo k ^ eS 

= / / supl^e^w^ifl^YixQy^dydx 

JR + JO k ^ gS 



/ suplE^.^/W)^)! 2 ^ 

JR + k 



2 



SUP \A k J\\\ 2 

k>0 



THE WALSH MODEL FOR M 2 * CARLESON 



11 



where the last equality is a consequence of (7). The corollary now follows from Proposi- 
tion 4.2 once we verify that 

\\\\fk(x)\\vr {k )U <r *\\fh. 

Note that for each x, f k (x) = a k>x *b k>x , where (a fcrE ) ? = e^ fc and (a k>x )^ = w^(x)E(fw^\V k )(x). 
The above estimate is now a consequence of Lemma 2.2, Lemma 4.1 and inequality (6). 
□ 

An argument very similar to the above also proves the following version of Corollary 4.3: 

Corollary 4.4. Consider a collection Q of N disjoint dyadic intervals u G R+. For each 
wGO and each k G Z let e k>u) G R . Define 

A k f(x):=J2tkAfW(x)- 

uieQ 

Then for each r > 2 

|| sup |A fc /||| L2 < . N^' 1 ' 2 sup ||e fe ,J^ (fe) ||/|| L2 . 

It turns out that the results of corollaries 4.3 and 4.4 are not general enough for our 
applications, and so we prove the following more general version. Consider now an arbi- 
trary set S = {£x, . . . ,£at} with no further restrictions on it, and for each k G Z define 
il k to be the set of all dyadic intervals of length 2~ k which contain some element of S. 
We now associate to each u G |J fc Vt k a number e u G R and define 

A k f(x) := J2 ^(KY{x). (8) 

Proposition 4.5. For each r > 2 we have the inequality 

||sup|A fe /||| 2 < r ^ /4 - V V||/|| 2 , 
k 

where 

a = sup sup Heujv^fc). 

Proof It suffices as before to assume that the index k runs through a finite interval 
{a, a + 1, . . . ,b} with a, b G Z. We can find a sequence a = ko < k\ < . . . < k^ = b with 
L < N, such that for each < j < L — 1, Q k has the same cardinality when kj < k < kj + \. 
If fj := (XLen ~~ XLen 1^)/, then the functions fj are pairwise orthogonal. We 

kj k j + l 

can now bound || sup fc |Afc/||| 2 by 

|| sup sup \( Yl ££■01112+ ( 9 ) 

J kj<k<k j+1 ueslkj+1 r>j 

+ II sup sup |( ewWjOllb- ( 10 ) 

j k 3 <k<k J+1 u)eQk 
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For each uj e ^k j+1 an d eac h kj < k < fcj+i, is defined to be the interval in 

Q k containing uj. Corollary 4.3 and scaling invariance show that the term (10) can be 
bounded by 

QT|| sn P \(Y,^JM\i) 1/2 <(J2 Nr/2 ~~ ls ^ su p II^Jv*wll/;ll!) 1/2 

. kj <k<k j+1 ^ . n ggg^ 

< JVr/4-1/2, 



12- 

To estimate the term in (9), define the maximal operators 

0*0) := sup |( E M^AT 
kj<k<k j+1 w6Q ^ +i 

We will argue that 

ii sup o*( y: /,oii 2 <^ /4 - i/2 (i:ii/,^) i/2 - 

It suffices to consider only dyadic values of N so we will assume that N = 2 M , for some 
M > 0. For each < m < M, denote by A m the best constant for which the following 
inequality holds for all discrete dyadic intervals J = (ji, J2] := {ji + 1, ji + 2, . . . , J2} 1 Q 
{1, 2 2 M } with 2 m elements 

ii su p°;( E /^ii^a^em) 172 - 

r 3 j<j'<h ji •/ 

We will use a reasoning similar to the one in the proof of the Rademacher-Menshov 
inequality, to argue that A M < B M , where 

B m := a2 m ^ 4 ~ 1 / 2 \ 

We can write for each < m < M — 1 and each discrete dyadic interval J = (ji , j'2] Q 
{1, 2, . . . , 2 M } having 2 m+1 elements and midpoint j'3 := j\ + 2 m 



•'' 7 i<i'<i 2 



y v 1 1 2 



<|| sup o*( £ / i om + 

33+l<3<h 



( sup o*( £ f r )h + \\ sup o*( £ /.oib). 

V Jl+ 1 ^^3 j^v^ il + l<i<i3 , 3+1 <,v< j2 / 

We then use the definition of A m for the first two terms above and Corollary 4.4 for the 
third one, to bound the sum above by 

Al £ WfA\l + (Am( E ll/;'ll!) 1/2 + CB m ( E H^ll^) 1/2 ) 2 

i3 + l<j'<j2 31<j'<33 J3 + l<i'<J2 



x ji and j 2 are of the form a2 b with a,b € Z + 
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< (A m + CB m ) 2 J2\\fi\\l 

We conclude that A m+ i < A m + CB m for each < m < M — 1, which together with the 
fact that A = proves that A M < B M . □ 

Remark 4.6. If in the above proposition we choose = 1 for each u, we recover the 
result of Bourgain from [1], with a slightly larger dependence on iV of the bound. While 
Bourgain's bound is logarithmic in N, a bound of the form jV r / 4-1 / 2 will suffice for our 
later applications, since we afford to take r as close to 2 as we want. 

5. POINTWISE ESTIMATES OUTSIDE EXCEPTIONAL SETS 

5.1. An estimate for a collection of 2-trees. Assume we have a collection S' C S of 
bitiles which can be written as a not necessarily disjoint union of 2-trees 

S' = |J T. 

We shall assume that if T G J 7 , then T is indeed the maximal 2-tree in S' with the top 
(7t)(t)i that is, all bitiles in P G S' which satisfy Ip C I T and £ T G c^p,2 are in T. 

Theorem 5.1. For each j3 > 1, 7 > and each (ap)p e s' define the exceptional sets 

E^ = {x:J2h T (x)>P}, 

E {2) = \J{x:\\ J2 a P w Pi( x ) llv(k) > 7}- 
Te^ 7 pet 

|/p|<2fc 

TTien /or eac/i x ^ E'W U -E^ 2 ^ and each r > 2 we have 

||( ^ a pWPl (x)l UPi2 (0))kez\\M m <r 7/9 r/4 - 1/2 - 

pes' 

|Ip|<2* 

Proof Fix x not in the union of the exceptional sets and let T x be the family of all trees 
TgJ 7 with x G It. Define 

For each A; G Z let be the collection of dyadic frequency intervals of length 2~ k which 
contain an element of S x . Let Q k be the collection of all children of intervals in flk-i that 
are not themselves in Qk- Observe that both |J fe , flk' and Q k U [j k , <k &h' are collections of 
pairwise disjoint intervals which cover R + (with the possible exception of finitely many 
dyadic points). Moreover we can write 

a P w Pl (x)l Wp2 (6) = 

PeS':\I P \<2 k 

= S apwp^x)!^^) 

uj£Cl k Pes' 

\i P \<2 k , o>nw 3j2 ^0 
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+ E E M«) E a P w Pl {x)l^ 2 {6)- 

k'<k w <zn Pes' 

K \l P \<2 k , unup i2 /8 

Indeed, if lw(9)w Pl l Up2 (0) ^ for some u G fife U Ufc'<fc ^fc' an d S', then this implies 
that wflwp : 2 7^ 0- Moreover, when a; G fife, this latter restriction together with \Ip\ < 2 k is 
equivalent with just asking that uo C uo Pt2 - Similarly, when u> G IJfc'<fc ^fc' then cjp )2 C u; is 
impossible, which in turn makes the requirement |ip| < 2 k superfluous. Indeed Wp i2 C a; 
would imply that Wp C w, contradicting the fact that Up contains an element from E x 
while us does not. Hence we can rewrite 

apw Pl (:r)l WF2 (0) = 

PeS':|/ P |<2 fc 

= E u*) E fl ^ww, 2 w (ii) 

to£il k Pes' 

+ E E U*) E ap^^l^^). (12) 
The multiplier in (12) can be written more conveniently as 

( x - E ( E E u*) E w, 2 (0) 

wen fc y fc' UJ gsi fc , Pes', wCojp i2 

= (i - E E ^^ii 1 ) 1 ^^)' 

iiefi t Pes' 

given the fact that (U/ 6 n fc J ) c = IW U/ e n fc , 1 and (Ufc'< fc D(Ufe'>fe = 0, modulo 
some dyadic points. This maximal multiplier operator is now easily seen to be the compo- 
sition of two operators. One is the identity minus Bourgain's maximal operator for which 
Proposition 4.5 provides good bounds. The second one is a linear operator associated 
with the multiplier Japes' a P w Pi( x )^-u P • To analyze the latter operator, we note that 
for each 9 the contribution to the multiplier comes from a single tree. To see this note 
that the collection 

A := {P G S' : x G I P , 9 G cj Pi2 } 

is finite and totally ordered and hence it contains a maximum element Pg. If Tg G T is 
one of the 2-trees to which Pg belongs, then by the maximality condition in the hypothesis 
it follows that P G Tg for each P G A. Moreover, there is some k such that 

E a P w Pi (*)W0)= E 

a P w Pl (x). 

PeS' PeT e 

\Ip\<2k 
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By invoking Proposition 4.5 and the fact that x £ E^> U we get that 
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I (1 - lfi>) 5^ a P w Pl (x)l Wp2 J 
\ uen k Pes' / 



kez 



All 



< (1 



(Yl 1 ^)fcez||jkf*)ll Y apWft(^)l W p, 2 ( )IU«»(9) 
(i>Gn fc Pes' 

< 7 ^/4-l/2_ 

The term (11) is clearly of the form 
with 

e w = Y / cl p w Pi (x). 

pes' 

We claim that for each nested sequence of intervals uj^ G flk we have the variational norm 
estimate 



l e oJ|v(fc) S 7- 



(13) 



This follows immediately from the fact that all bitiles contributing to e UJk belong to a 
single tree T G JF. Indeed, the collection 

B := {P G S' : x G Ip, uok Q tup,2 for some A;} 

is finite and totally ordered, so it has a maximum element P x . If G ^ is one of 
the 2-trees to which P x belongs, then from the maximality condition in the hypothesis it 
follows that P E T x for each P e B. Moreover, for each k 

{PeS':uj k C uj Pt2 } = {PeT x : \I P \ < 2 k }. 

Thus (13) and Proposition 4.5 imply that 

/ \ 

Y l M Y apwp^x)!^^) 



\ 



pes 



P,2 



I 



kez 



< 7/r /4-l/2. 



M*(e) 



□ 



5.2. An estimate for a collection of 1-trees. The discussion here is very similar to 
that for 2-trees. Assume we have a collection S' of bitiles which can be written as a not 
necessarily disjoint union of finitely many 1-trees 



S' = |J T. 



TeP 



We shall assume that for every P G S' there does not exist a tree T G with Ip C It and 
£t G cjp )2 . This assumption does in particular imply that the upper tiles P 2 are pairwise 
disjoint. For assume not and I P C ip/ and wp/ j2 C cj P2 for some P, P', then it is easy to 
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see that the upper tile P 2 violates the above assumption with respect to any tree to which 
P' belongs. 



Theorem 5.2. Let (a P )p e s> satisfy 





dp' 




\lp\ 


1/2 



SU P r n/9 ^ °- ( 14 ) 

pgS' 



For each a > 1 define the exceptional set 

E = {x:J2 l i T (x)>(3}. 
TeP 

Then for each x E and each r > 2 we have 

||( ^ a P w Pii. x ) l u> P , 2 )k&z\\M* 2 <r 0-f3 r/i ~ 1/2 . 
pes' 

\l p \<2k 

Proof As before we write 

Y apwp^x)!^^) = 

PeS':\I P \<2 k 

= E l "( e ) E apwp^x)^^) 
+ E E U*) E apWp^l^M- 

k'<k* ue n, Pes' 

The argument continues as in the previous section. Since the upper tiles P 2 are pairwise 
disjoint, the collections A and B contain at most one bitile. This observation together 
with (14) implies that 

|| Y a pwpi(z)W,2(0)IU~(0) < °~ 

Pes' 

and 

|| £ a;fc||v(fc) <r O- 

An application of Proposition 4.5 ends the proof. □ 

5.3. Arbitrary collection of trees. Let S' be an arbitrary collection of bitiles which 
can be written as a not necessarily disjoint union of finitely many trees 

S' = |J T. 

TeP 

We next show that S' can be split into a collection of 2-trees like in Section 5.1 and a 
collection of 1-trees like in Section 5.2. 

For each T G T let T^ 2 ) be the collection of all bitiles P G S' such that I P C I T and 
£t £ <^p,2- If S^ 2 ^ denotes the union of all trees T^ 2 \ then S^ 2 ^ qualifies as a collection of 
trees as in Section 5.1. 

For each T G T let be the collection of all bitiles P 6 S'\ S (2) such that I P C 7 T 
and £ T G w^i. If be the union of all trees then qualifies as a collection of 
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trees as in Section 5.2. The additional geometric assumption is satisfied since we have 
exhausted all 2-trees first. 

We will denote by and J 7 ^ respectively the two families of trees that arise from the 
above procedure. An immediate consequence of the results in the previous two subsections 
is the following theorem. 



Theorem 5.3. Let (a P )p eS ' satisfy 



sup 

Pes' 





dp 




\lp 


1/2 



< a. 



For each (5 > 1 and 7 > define the exceptional sets 

E W = {x:J2h T (x)>(3}, 



E<$ = (J {x:\\ a P w Pi( x )\\vr(k) > 7>- 

Te/( 2 ) 



P6T 

J>|<2* 



Then for each x E^> U and each r > 2 we have 

||( a P w Pl (x)l^ 2 ) keZ \\ M ^ < r (a + 7 )/3 r/4 - 1/2 - 



pgs' 

|/p|<2fc 



6. Main argument 



In this section we present the proof of Theorem 1.3. For each collection of bitiles S'CS 
define the following operator. 



V S 'f(x) 



f ^ 



pes' 

Ip I <2 k 



kez 



M*{6) 



Note that for each S' the operator Vs' is sublinear as a function of /. Also, for each / 
and x the mapping S' — > Ts>f(x) is sublinear as a function of the bitile set S'. We will 
prove in the following that 

m{x : Vsl F (x) > A} < p (15) 

for each F C R + of finite measure, each A > and each 1 < p < 00. Then, by invoking 
the Marcinkiewicz interpolation theorem and restricted weak type interpolation we get 
for each 1 < p < 00 that 

IIWIIp < p ll/llp. 

Fix F and A. We first prove (15) in the case A < 1. Define the first exceptional set 

E := {x : M p l F (x) > A} 

and note that \E\ < Since the range of p is open, it thus suffices to prove that for 
each e > 



m{xER: V Sl l F (x) > < e , P 



\F\ 



(16) 
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where 

Si = {P E S : I P n E c ^ 0}. 

Proposition 3.4 guarantees that size(Si) < A, where the size is understood here with 
respect to the function lp. Define A := [— log 2 (size(Si))]. Use the result of Proposition 3.5 
to split Si as a disjoint union Si = IJ n >A P«> wnere size(P n ) < 2~™ and each P n consists 
of a family jF Pn of trees satisfying 

E I J tI^2 2 "|F|. (17) 

Let e > be an arbitrary positive number. For each n > A define a := 2~ n , (3 := 2 3ri A p , 
7 := 2~ n / 2 A 1 / 2 ~ e . Define a P := (l F ,w Pl ) for each P G P„ and note that the collection P n 
together with the coefficients (ap)p G p n satisfy the requirements of Theorem 5.3. Let jFpj 
be the collection of all the 2-trees T^ 2 ^ obtained from the trees T e J-p n by the procedure 
described in the beginning of Section 5.3. Define the corresponding exceptional sets 

E^ = {x:J2h T (x)>P}, 

E n ] = U i X: W Yl a P W Pl( X )\\v(k)>1f}- 

T£^(2) -PST 
\I P \<2k 

By (17) and the fact that A < 1 we get 

i£(Di<2-»A-*in 

By Theorem 3.7 and the fact that A < 1, for each 1 < s < oo we get 

\ E n ] \ ^ 7" s o" s_2 |F| < 2- n ( s / 2 ^ 2 )A _s(1/2_e) |F|. 

Define 

E* := U(^U^). 

n>A 

Note that since A > log 2 (A~ 1 ), we have \E*\ < A~ P |F|, an estimate which can be seen by 
using a sufficiently large s. 

For each x ^ E*, Theorem 5.3 guarantees that 



PGS 1 
\I P \<2 k 



< 



n>A PePn 

\I P \<2 k 

E n [2( 3 ( r / 2_1 )- 1 ) n A p(r/2 ~ 1) + 2( 3 ( r / 2 - 1 )- 1 / 2 )"A p(r/2 ~ 1)+1/2_e ] 

n>A 

if r is chosen sufficiently close to 2, depending on p and e. This ends the proof of (16), 
and hence the proof of (15) in the case A < 1. 

We next focus on proving (15) in the case A > 1. In this remaining part of the discussion 
the size will be understood with respect to the function A _1 lp. Proposition 3.4 implies 
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that size(S) < A -1 . Define A := [— log 2 (size(S))]. Split S as before, as a disjoint union 
S = Un>A-P«' wnere size(P„) < 2~ n and each P n consists of a family J r Pn of trees 
satisfying 

\It\ <2 2n \- 2 \F\. (18) 

t^p„ 

For each n > A define a := 2~ n , (3 := 2^ +1 > l and 7 := 2~ n ' 2 . Define also a P : = 
(A _1 lp, wpj for each P e P„ and note that the collection P„ together with the coefficients 
(ap)p G p n satisfy the requirements of Theorem 5.3. Let jFpj the collection of all the 2-trees 
T^ 2 ) obtained from the trees T e J^p n by the procedure described in the beginning of the 
Section 5.3. Define the corresponding exceptional sets 



E^ = {x:J2h T (x)>P}, 
E [ n ] = |J {x:\\ a P w Pi( x )\\vr(k) >7>- 



PST 

l'pl<2* 



By (18) and the fact that A > 1 we get 

| E (i)| < 2 -(p-i)«a- 2 |F|. 
By Theorem 3.7 and the fact that A > 1, for each 1 < s < 00 we get 

|£( 2 )| < 7 -V s ^ 2 A~ 2 |F| < 2- n ( s / 2 ~ 2 )A~ 2 |F|. 



Define 



E* := {J(EMuEW). 



n>A 



Note that since A > log 2 (A), we have \E*\ < A~ P |F|, an estimate which can be seen by 
using a sufficiently large s. 

For each x (jz E*, Theorem 5.3 guarantees that 

/ 

^ {x^If.wp^wp^x)!^^) 



pes 

|Jp|<2* 



n>A 



\ 



M*(e) 



^ (A 1 l F ,w Pl }wp 1 (x)l LUp2 (6) 



P6P„ 

\I P \<2 k 



J 



kez 



M*{6) 



< ^2 n2 (p+1)(r/2 ~ 1)n (2~™ + 2~ n/2 ) 



n>A 



<1, 



if r is chosen sufficiently close to 2, depending only on p. This ends the proof of (15) in 
the case A > 1. 
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